Picture for Xilin Chen

Xilin Chen

Dynamic Attention Analysis for Backdoor Detection in Text-to-Image Diffusion Models

Add code
Apr 29, 2025
Viaarxiv icon

DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks

Add code
Apr 24, 2025
Viaarxiv icon

EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models

Add code
Mar 26, 2025
Viaarxiv icon

REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models

Add code
Mar 20, 2025
Viaarxiv icon

OpenEarthSensing: Large-Scale Fine-Grained Benchmark for Open-World Remote Sensing

Add code
Feb 28, 2025
Viaarxiv icon

MATS: An Audio Language Model under Text-only Supervision

Add code
Feb 20, 2025
Viaarxiv icon

Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation

Add code
Jan 08, 2025
Viaarxiv icon

M$^3$oralBench: A MultiModal Moral Benchmark for LVLMs

Add code
Dec 30, 2024
Viaarxiv icon

Multi-P$^2$A: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models

Add code
Dec 27, 2024
Viaarxiv icon

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

Add code
Nov 25, 2024
Viaarxiv icon